AITopics | Amanat Al Asimah

We present the Manuscripts of Handwritten Arabic (Muharaf) dataset, which is a machine learning dataset consisting of more than 1,600 historic handwritten page images transcribed by experts in archival Arabic.

machine learning, natural language, pattern recognition, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > North Carolina (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Middle East > Iran (0.04)
(6 more...)

Genre: Research Report (0.46)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.83)

Add feedback

Language Model Tokenizers Introduce Unfairness Between Languages

Neural Information Processing SystemsFeb-14-2026, 13:54:12 GMT

Recent language models have shown impressive multilingual performance, even when not explicitly trained for it. Despite this, there are concerns about the quality of their outputs across different languages. In this paper, we show how disparity in the treatment of different languages arises at the tokenization stage, well before a model is even invoked. The same text translated into different languages can have drastically different tok-enization lengths, with differences up to 15 times in some cases. These disparities persist even for tokenizers that are intentionally trained for multilingual support.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

North America > Haiti (0.14)
Asia > Philippines > Luzon > Ilocos Region > Province of Pangasinan (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
(38 more...)

Genre: Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)

Add feedback

The drones being used in Sudan: 1,000 attacks since April 2023

Al JazeeraFeb-3-2026, 06:24:33 GMT

During Sudan's civil war, which erupted in April 2023, both sides have increasingly relied on drones, and civilians have borne the brunt of the carnage. The conflict between the Sudanese armed forces (SAF) and the Rapid Support Forces (RSF) paramilitary group is an example of war transformed by commercially available, easily concealable unmanned aerial vehicles (UAVs), or drones. Modular, well-adapted to sanctions evasions and devastatingly effective, drones have killed scores of civilians, crippled infrastructure and plunged Sudanese cities into darkness. In this visual investigation, Al Jazeera examines the history of drone warfare in Sudan, the types of drones used by the warring sides, how they are sourced, where the attacks have occurred and the human toll. The RSF traces its origins to what at the time was a government-linked militia known as the Janjaweed.

artificial intelligence, drone, sudan, (11 more...)

Al Jazeera

Country:

South America (0.40)
North America > United States (0.40)
North America > Central America (0.40)
(27 more...)

Industry:

Information Technology (1.00)
Government > Military > Army (0.70)
Government > Military > Air Force (0.47)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)

Add feedback

All the countries Israel attacked in 2025: Animated map

Al JazeeraDec-29-2025, 05:16:43 GMT

Why is Israel still in southern Lebanon? A war to shape Lebanon's future How many countries has Israel attacked in 2025? Israel has attacked more countries than any other country this year. In 2025, Israel attacked at least six countries, including Palestine, Iran, Lebanon, Qatar, Syria, and Yemen. It also carried out strikes in Tunisian, Maltese and Greek territorial waters on aid flotillas heading for Gaza.

israel, jazeera, lebanon, (16 more...)

Al Jazeera

Country:

Asia > Middle East > Israel (1.00)
North America > United States (0.70)
Asia > Middle East > Palestine > Gaza Strip > Gaza Governorate > Gaza (0.44)
(6 more...)

Industry:

Government > Military (1.00)
Government > Regional Government > Asia Government > Middle East Government (0.71)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.49)

Add feedback

On Global Applicability and Location Transferability of Generative Deep Learning Models for Precipitation Downscaling

Harder, Paula, Lessig, Christian, Chantry, Matthew, Pelletier, Francis, Rolnick, David

arXiv.org Artificial IntelligenceDec-2-2025

Deep learning offers promising capabilities for the statistical downscaling of climate and weather forecasts, with generative approaches showing particular success in capturing fine-scale precipitation patterns. However, most existing models are region-specific, and their ability to generalize to unseen geographic areas remains largely unexplored. In this study, we evaluate the generalization performance of generative downscaling models across diverse regions. Using a global framework, we employ ERA5 reanalysis data as predictors and IMERG precipitation estimates at $0.1^\circ$ resolution as targets. A hierarchical location-based data split enables a systematic assessment of model performance across 15 regions around the world.

artificial intelligence, machine learning, precipitation, (17 more...)

arXiv.org Artificial Intelligence

2512.014

Country:

North America > United States > Montana > Roosevelt County (0.04)
North America > Canada > Quebec (0.04)
Pacific Ocean (0.04)
(9 more...)

Genre: Research Report > New Finding (0.66)

Industry: Energy > Renewable (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

From One Attack Domain to Another: Contrastive Transfer Learning with Siamese Networks for APT Detection

Benabderrahmane, Sidahmed, Rahwan, Talal

arXiv.org Artificial IntelligenceNov-26-2025

Advanced Persistent Threats (APT) pose a major cybersecurity challenge due to their stealth, persistence, and adaptability. Traditional machine learning detectors struggle with class imbalance, high dimensional features, and scarce real world traces. They often lack transferability-performing well in the training domain but degrading in novel attack scenarios. We propose a hybrid transfer framework that integrates Transfer Learning, Explainable AI (XAI), contrastive learning, and Siamese networks to improve cross-domain generalization. An attention-based autoencoder supports knowledge transfer across domains, while Shapley Additive exPlanations (SHAP) select stable, informative features to reduce dimensionality and computational cost. A Siamese encoder trained with a contrastive objective aligns source and target representations, increasing anomaly separability and mitigating feature drift. We evaluate on real-world traces from the DARPA Transparent Computing (TC) program and augment with synthetic attack scenarios to test robustness. Across source to target transfers, the approach delivers improved detection scores with classical and deep baselines, demonstrating a scalable, explainable, and transferable solution for APT detection.

artificial intelligence, data mining, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2511.205

Country:

North America > United States (0.47)
Asia > North Korea (0.14)
North America > Canada > Quebec > Montreal (0.04)
(2 more...)

Genre: Research Report > New Finding (0.92)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.88)
Government > Regional Government > North America Government > United States Government (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

BNLI: A Linguistically-Refined Bengali Dataset for Natural Language Inference

Haque, Farah Binta, Yasin, Md, Saha, Shishir, Rafi, Md Shoaib Akhter, Sadeque, Farig

arXiv.org Artificial IntelligenceNov-13-2025

Despite the growing progress in Natural Language Inference (NLI) research, resources for the Bengali language remain extremely limited. Existing Bengali NLI datasets exhibit several inconsistencies, including annotation errors, ambiguous sentence pairs, and inadequate linguistic diversity, which hinder effective model training and evaluation. To address these limitations, we introduce BNLI, a refined and linguistically curated Bengali NLI dataset designed to support robust language understanding and inference modeling. The dataset was constructed through a rigorous annotation pipeline emphasizing semantic clarity and balance across entailment, contradiction, and neutrality classes. We benchmarked BNLI using a suite of state-of-the-art transformer-based architectures, including multilingual and Bengali-specific models, to assess their ability to capture complex semantic relations in Bengali text. The experimental findings highlight the improved reliability and interpretability achieved with BNLI, establishing it as a strong foundation for advancing research in Bengali and other low-resource language inference tasks.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2511.08813

Country: